Is N-Best Dead?

نویسندگان

  • Long Nguyen
  • Richard M. Schwartz
  • Ying Zhao
  • George Zavaliagkos
چکیده

We developed a faster search algorithm that avoids the use of the N-Best paradigm until after more powerful knowledge sources have been used. We found, however, that there was little or no decrease in word errors. We then showed that the use of the N-Best paradigm is still essential for the use of still more powerful knowledge sources, and for several other purposes that are outlined in the paper. 1. I N T R O D U C T I O N The N-Best Paradigm [1] was introduced originally as a means for integrating the speech recognition and language understanding components of a spoken language system. Since then, we have generalized its use for integrating into the recognition search other expensive knowledge sources (such as higher-order n-gram language models, betweenword co-articulation models, and segmental models) without increasing the search space [2]. The basic idea is that we use inexpensive knowledge sources to find N alternative sentence hypotheses. Then we rescore each of these hypotheses with the more expensive and more accurate knowledge sources in order to determine the most likely utterance. The N-Best Paradigm specifically, and multi-pass search algofithms in general, are now used widely by the speech recognition research community. Besides its use as an efficient search strategy, the N-Best Paradigm has been used extensively in several other ways [2]. Its simplicity has made it ideal as a means for cooperation between research sites. For example, we regularly send the N-Best lists of alternatives to research sites that do not have an advanced speech recognition capability (e.g., Paramax and NYU) in order that they can apply their own linguistic components for understanding or for research into alternative language modeling techniques. Another related use of the N-Best lists is for evaluation of alternative knowledge sources. New knowledge sources can be evaluated without having to integrate them into the search strategy. For example, we can determine whether a new prosodic module or linguistic knowledge source reduces the error rate when used to reorder the N-Best list. This is particularly important for knowledge sources that are not easily formulated in a left-to-ddght incremental manner. Finally, we have presented techniques for optimizing the weights for different knowledge sources, and for discriminative training [2]. In this paper we attempt to determine whether the N-Best Paradigm results in substantial search errors. If it does, then its use for the other purposes mentioned above would also be questionable. First we describe briefly how we used the N-Best paradigm in previous versions of BYBLOS. Then, we descfibe our attempts to avoid the errors that might be a result of using the N-Best paradigm. Finally, we argue that there will always be cases where the N-Best paradigm will make it possible to use some knowledge sources that would likely never be used otherwise. 2. 3 P A S S N B E S T S E A R C H S T R A T E G Y The BYBLOS system has been described previously (e.g., [3]). We reiterate here the use of the N-Best Paradigm in that system. The decoder used a 3-pass search strategy. The strategy used a forward pass followed by a backward Word-Dependent NBest search algorithm [4] using a bigram language model, within-word triphone models, and top-1 (discrete VQ) densities. The N-Best hypotheses were then rescored using crossword triphone context models, top-5 mixture densities, and trigram language model. Typically, the backward Word-Dependent N-Best pass requires about half the time required by the forward pass. Rescoring each alternative sentence hypothesis individually with cross-word triphone models only requires about 0.2 seconds per hypothesis. And rescofing the text of the hypotheses with a high-order n-gram language model [5] requires essentially no time. 3. A D M I S S I B I L I T Y It has often been asserted that the N-Best paradigm is inadmissible because when the initial N-Best list is created using weaker knowledge sources, then the answer that would have had the highest score using the stronger knowledge sources might not be within the list of alternatives, and therefore never have a chance to be rescored. This would be especially likely when the error rate is high and the utterances

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relationship between Dead Trees with Soil Physico-chemical Properties and Earthworm in Mixed Broad-leaved Forest Stand (Case study: Sarcheshmeh Forest, Chaloos)

Dead trees protection, has a key role in structural and biogeochemical processes in forest ecosystems. Some aspects of dead tree dynamics have been carefully studied, but the kind and decay degree of dead trees and forest soil properties have not received enough attention. The aim of this research was to study the effect of a kind and decay degree of dead trees on soil mineral properties in the...

متن کامل

A Case of Bilateral Agenesis of the Femur

Bilateral femoral agenesis is a rare anomaly.  To the best of our knowledge, only three cases of simple congenital anomaly and three cases associated with femoral facial syndrome have been reported.  Here, we describe a simple form of bilateral femoral agenesis observed in one of the 2 dead fetuses delivered after termination of a 24-week twin pregnancy of a normal mother.  Post-mortem x-ray ex...

متن کامل

Designing Robust Finite-Time Nonlinear Torques for a n-DOF Robot Manipulator with Uncertainties, Sector and Dead-Zone Input Nonlinearities

In this paper, a complete dynamical model is presented for an uncertain -DOF robot manipulator containing description of sector and dead-zone input nonlinearities. Next, robust finite-time tracking problem of desired trajectories is declared and formulated for the aforementioned robot manipulator. By defining innovative nonlinear sliding manifolds and developing the nonsingular terminal sliding...

متن کامل

Using Intelligent Methods and Optimization of the Existing Empirical Correlations for Iranian Dead Oil Viscosity

Numerous empirical correlations exist for the estimation of crude oil viscosities. Most of these correlations are not based on the experimental and field data from Iranian geological zone. In this study several well-known empirical correlations including Beal, Beggs, Glasso, Labedi, Schmidt, Alikhan and Naseri were optimized and refitted with the Iranian oil field data. The results showed that ...

متن کامل

BEST APPROXIMATION SETS IN -n-NORMED SPACE CORRESPONDING TO INTUITIONISTIC FUZZY n-NORMED LINEAR SPACE

The aim of this paper is to present the new and interesting notionof ascending family of  $alpha $−n-norms corresponding to an intuitionistic fuzzy nnormedlinear space. The notion of best aproximation sets in an  $alpha $−n-normedspace corresponding to an intuitionistic fuzzy n-normed linear space is alsodefined and several related results are obtained.

متن کامل

A typological view of possessive constructions in Sign Language of the Netherlands

ones (6b), as well as for kinship terms (6c, 6d) and part-whole relations with animate (6e) and inanimate possessors (6f). Juxtaposing nominal possessors and possessums in NGT then does not seem to be restricted to alienable or inalienable possession. [VanDale COACH] (6) a. DUTCH NATIONAL-TEAM COACH INDEX3 EVERYBODY ALWAYS CRITICISE3 'The coach of the Dutch national (soccer) team is always crit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994